Cost-Time Sensitive Decision Tree with Missing Values
نویسندگان
چکیده
Cost-sensitive decision tree learning is very important and popular in machine learning and data mining community. There are many literatures focusing on misclassification cost and test cost at present. In real world application, however, the issue of time-sensitive should be considered in costsensitive learning. In this paper, we regard the cost of time-sensitive in costsensitive learning as waiting cost (referred to WC), a novelty splitting criterion is proposed for constructing cost-time sensitive (denoted as CTS) decision tree for maximal decrease the intangible cost. And then, a hybrid test strategy that combines the sequential test with the batch test strategies is adopted in CTS learning. Finally, extensive experiments show that our algorithm outperforms the other ones with respect to decrease in misclassification cost.
منابع مشابه
“ Missing is Useful ” : Missing Values in Cost - sensitive Decision Trees 1
Many real-world datasets for machine learning and data mining contain missing values, and much previous research regards it as a problem, and attempts to impute missing values before training and testing. In this paper, we study this issue in cost-sensitive learning that considers both test costs and misclassification costs. If some attributes (tests) are too expensive in obtaining their values...
متن کاملTest cost and misclassification cost trade-off using reframing
Many solutions to cost-sensitive classification (and regression) rely on some or all of the following assumptions: we have complete knowledge about the cost context at training time, we can easily re-train whenever the cost context changes, and we have technique-specific methods (such as cost-sensitive decision trees) that can take advantage of that information. In this paper we address the pro...
متن کاملModel Reframing by Feature Context Change
Many solutions to cost-sensitive classification (and regression) rely on some or all of the following assumptions: we have complete knowledge about the cost context at training time, we can easily re-train whenever the cost context changes, and we have technique-specific methods (such as cost-sensitive decision trees) that can take advantage of that information. In this work we address the prob...
متن کاملIdentification of the most important factors of ethnic differences in anthropometric dimensions of Iranian workers using the decision tree
Background and aims: Anthropometry is the branch of human science that considers the physical measurement of the human body, especially size and shape. One application of anthropometrical data in ergonomics is the design of working space and the development of industrialized products. So that the tools, equipment and workstations, which designed based on the physical dimensions of the workers, ...
متن کاملVFDT Algorithm for Decision Tree Generation
The purpose of data classification is to construct a classification model. The decision tree algorithm is a more general data classification function approximation algorithm based on machine learning. The decision tree is directed and acyclic. Iterative Dichotomiser 3(ID3) algorithm invented by Ross Quinlan is used to generate decision tree from a dataset. Considering its limitations layer an o...
متن کامل